IN THE CLAIMS: 

Please amend the claims as follows: 



1 . (Currently Amended) A method of voice recognition, comprising the steps of: 

organizing a plurality of speaker data points, representing a plurality of enrollment 
speakers, into a data structure using high-dimensional vectors that represent 
characteristics of enrollment voice samples from the enrollment speakers; 
estimating a probability density function of a subset of the plurality of speaker data 
points using Parzen windows, wherein the subset comprises comprising the 
approximate nearest neighbors to an unidentified voice sample from an 
unidentified speaker , the subset not including all speaker data points in the 
plurality of speaker data points ; and 




identifying the unidentified speaker based on one or more speaker data points most 
closely matching the unidentified voice sample as indicated by the estimated 
probability density function . 

2. (Cancelled) 

3. (Currently Amended) The method of claim 1, wherein the step of estimating the 
probability density comprises estimating the density based on a distance between individual 
speaker data points within the subset of speaker data points. 

4. (Currently Amended) The method of claim 1, wherein the step of estimating the 
probability density further comprises controlling the relative contributions of individual 
speaker data points within the subset of speaker data points to the density based on a distance 
to a speaker data point from the unidentified voice sample. 

5. (Currently Amended) The method of claim 1, wherein the step of estimating the 
probability density comprises estimating the density of the subset of speaker data points 
independent of parametric distribution information related to the plurality of speaker data 
points. 
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6. (Original) The method of claim 1, wherein the data structure module organizes the 
plurality of speaker data points such that a distance between individual speaker data points is 
based on characteristic similarities between associated voice samples, the distance measured 
in terms of one from the group containing: a Euclidean distance, a Minkowski distance, and a 
Manhattan distance. 

7. (Original) The method of claim 1, wherein the data structure comprises a kd-tree. 

8. (Original) The method of claim 1, wherein the plurality of speaker data points 
comprises a relatively large number of speaker data points. 

9. (Original) The method of claim 1 , further comprising a step of retrieving the subset 
of speaker data points using an unidentified speaker data point from the unidentified voice 
sample as an index into the plurality of speaker data points. 

10. (Original) The method of claim 8, wherein the step of retrieving the subset of speaker 
data points comprises retrieving approximate nearest neighbors to the unidentified speaker 
data point, the approximate nearest neighbors comprising speaker data points within a 
distance calculated as a function of a distance of an absolute nearest neighbor. 

1 1 . (Original) The method of claim 1 , wherein the subset of speaker data points includes 
more than one speaker data points associated with a common identification, and the step of 
identifying the unidentified speaker accumulates a score for the common identification. 

12. (Original) The method of claim 1, further comprising extracting the high- 
dimensional vectors from the enrollment voice samples and the unidentified voice sample. 

13. (Original) The method of claim 1, wherein the step of identifying the unidentified 
speaker comprises identifying the unidentified speaker as one of the enrollment speakers if 
matching is within an error threshold. 

14. (Original) The method of claim 1, wherein an enrollment voice sample and the 
unidentified voice sample of a common speaker are text-independent. 

15. (Currently Amended) A method of voice recognition, comprising the steps of: 
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retrieving a subset of speaker data points by using an unidentified speaker data point 
as an index into a data structure comprising a plurality of speaker data points, 
the subset of speaker data points representing approximate nearest neighbors 
to the unidentified speaker data , the subset not including all speaker data 
points in the plurality of speaker data points ; 

estimating a probability density function from a subset of the plurality of speaker data 
points using Parzen windows ; and 

identifying the unidentified speaker based on one or more speaker data points most 

closely matching the unidentified voice sample as indicated by the probability 
density function. 

16. (Cancelled) 

17. (Currently Amended) A voice recognition system, comprising: 

means for organizing a plurality of speaker data points, representing a plurality of 

enrollment speakers, into a data structure using high-dimensional vectors that 
represent characteristics of enrollment voice samples from enrollment 
speakers; 

means for estimating a density of a subset of the plurality of speaker data points using 
Parzen windows, wherein the subset comprises comprising the approximate 
nearest neighbors to an unidentified voice sample from an unidentified 
sp eaker , the subset not including all speaker data points in the plurality of 
speaker data points ; and 

means for identifying the unidentified speaker based on one or more speaker data 

points most closely matching the unidentified voice sample as indicated by the 
estimated density. 

18. (Cancelled) 

19. (Original) The system of claim 17, wherein the means for estimating estimates the 
density based on a distance between individual speaker data points within the subset of 
speaker data points. 
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20. (Original) The system of claim 17, wherein the means for estimating includes a 
smoothing parameter to control the relative contributions of individual speaker data points 
within the subset of speaker data points to the probability density function based on a 
distance to a speaker data point from the unidentified voice sample. 

21 . (Original) The system of claim 17, wherein the means for estimating estimates the 
density of the subset of speaker data points independent of parametric distribution 
information related to the plurality of speaker data points. 

22. (Original) The system of claim 17, wherein the means for organizing organizes the 
plurality of speaker data points such that a distance between individual speaker data points is 
based on characteristic similarities between associated voice samples, the distance measured 
in terms of one from the group containing: a Euclidean distance, a Minkowski distance, and a 
Manhattan distance. 

23. (Original) The system of claim 17, wherein the means for organizing comprises a kd- 
tree. 

24. (Original) The system of claim 17, wherein the plurality of speaker data points 
comprises a relatively large number of speaker data points. 

25. (Original) The system of claim 17, further comprising means for retrieving the subset 
of speaker data points uses an unidentified speaker data point from the unidentified voice 
sample as an index into the plurality of speaker data points. 

26. (Original) The system of claim 25, wherein the means for retrieving the subset of 
speaker data points retrieves approximate nearest neighbors to the unidentified speaker data 
point, the approximate nearest neighbors comprising speaker data points within a distance 
calculated as a function of a distance of an absolute nearest neighbor. 

27. (Original) The system of claim 17, wherein the subset of speaker data points includes 
more than one speaker data points associated with a common identification, and the 
identification module accumulates a score for the common identification. 
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28. (Original) The system of claim 17, further comprising a means for extracting the 
high-dimensional vectors from voice samples. 

29. (Original) The system of claim 17, wherein the means for identifying identifies the 
unidentified speaker as one of the enrollment speakers if matching is within an error 
threshold. 

30. (Original) The system of claim 17, wherein an enrollment voice sample and the 
unidentified voice sample of a common speaker are text-independent. 

3 1 . (Currently Amended) A computer program product, comprising: 

a computer-readable medium having computer program instructions and data 
embodied thereon for voice recognition, comprising the steps of: 
organizing a plurality of speaker data points, representing a plurality of 
enrollment speakers, into a data structure using high -dimensional 
vectors that represent characteristics of enrollment voice samples from 
the enrollment speakers; 
estimating a probability density function of a subset of the plurality of speaker 
data points using Parzen windows, wherein the subset comprises 
comprising the approximate nearest neighbors to an unidentified voice 
sample from an unidentified speake r, the subset not including all 
speaker data points in the plurality of speaker data points ; and 
identifying the unidentified speaker based on one or more speaker data points 
most closely matching the unidentified voice sample as indicated by 
the estimated probability density function . 

32. (Cancelled) 

33 . (Currently Amended) The computer program product of claim 3 1 , wherein the step 
of estimating the probability density comprises estimating the density based on a distance 
between individual speaker data points within the subset of speaker data points^ 
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34. (Currently Amended) The computer program product of claim 3 1 , wherein the step 
of estimating the probability density further comprises controlling the relative contributions 
of individual speaker data points within the subset of speaker data points to the probability 
density function based on a distance to a speaker data point from the unidentified voice 
sample. 

35 . (Currently Amended) The computer program product of claim 3 1 , wherein the step 
of estimating the probability density comprises estimating the probability density function of 
the subset of speaker data points independent of parametric distribution information related 
to the plurality of speaker data points. 

36. (Original) The computer program product of claim 3 1 , wherein the data structure 
module organizes the plurality of speaker data points such that a distance between individual 
speaker data points is based on characteristic similarities between associated voice samples, 
the distance measured in terms of one from the group containing: a Euclidean distance, a 
Minkowski distance, and a Manhattan distance. 

37. (Original) The computer program product of claim 31, wherein the data structure 
comprises a kd-tree. 

38. (Original) The computer program product of claim 3 1 , wherein the plurality of 
speaker data points comprises a relatively large number of speaker data points. 

39. (Original) The computer program product of claim 3 1 , further comprising a step of 
retrieving the subset of speaker data points using an unidentified speaker data point from the 
unidentified voice sample as an index into the plurality of speaker data points. 

40. (Original) The computer program product of claim 38, wherein the step of retrieving 
the subset of speaker data points comprises retrieving approximate nearest neighbors to the 
unidentified speaker data point, the approximate nearest neighbors comprising speaker data 
points within a distance calculated as a function of a distance of an absolute nearest neighbor. 

4 1 . (Original) The computer program product of claim 3 1 , wherein the subset of speaker 
data points includes more than one speaker data points associated with a common 
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identification, and the identification module accumulates a score for the common 
identification. 

42. (Original) The computer program product of claim 3 1 , further comprising extracting 
the high-dimensional vectors from the enrollment voice samples and the unidentified voice 
sample. 

43. (Original) The computer program product of claim 3 1 , wherein the step of 
identifying the unidentified speaker comprises identifying the unidentified speaker as one of 
the enrollment speakers if matching is within an error threshold. 

44. (Original) The computer program product of claim 3 1 , wherein an enrollment voice 
sample and the unidentified voice sample of a common speaker are text-independent. 
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